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Abstract. We analyze a path-lifting algorithm for finding an approximate zero of a complex 
polynomial, and show that for any polynomial with distinct roots in the unit disk, the average number 
of iterates this algorithm requires is universally bounded by a constant times the log of the condition 
number. In particular, this bound is independent of the degree d of the polynomial. The average is 
taken over initial values z with |z| = 1 + using uniform measure. 



A point is an approximate zero for a function f{z) if it converges quadratically to a zero 
under Newton's method. The notion of an approximate zero was introduced by Smale in [Sm81]. 
A sufficient condition to determine if is an approximate zero for / using only evaluation of /(z*) 
and its derivatives at was developed by Kim ([K85], [K88]); this condition was sharpened and 
extended to apply also to systems of polynomials by Smale [Sm86]. Nowadays, this approach is 
commonly called a-theory. We will use the Kim-Smale criterion to locate approximate zeros; see 
Theorem 3.2. 

Depending on the context of the problem, the goal might be to produce a point z so that \f{z) \ < 
e, or one might desire that |l — CI < £ where /(C) = 0. In either case, such a solution is called 
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an £-root. Notice that an approximate zero may not be an £-root. However, such a point 
will converge to an e-zero quadratically. Consequently, locating an approximate zero resolves the 
question of producing an £-root. See Def. 3.1 for the specific definition. 

In this paper, we discuss the use of a path-lifting method (which we call the a-step method, 
a variation of the algorithm developed in [K85] and [K88]) to locate an approximate zero for 
a complex polynomial /(z), and show that for any polynomial /, the average number of steps 
required by the algorithm is universally bounded, independent of the degree of /, where the average 
is taken over the starting points for the method. In fact, the average cost depends on the average 
of the logarithms of several of the critical values of / (this, in turn, is less than the logarithm 
of the condition number; see Remark 3.8). We note that the results of [K88] imply that for any 
polynomial, this method converges except on a finite set of starting values. 

More precisely, we have the following. 

Theorem 1.1 (Main Theorem). Let f : C ^ C be a polynomial with distinct roots Q in the unit 
disk. There is a constant Af, independent of the degree of f, so that the average number of steps 
required by the a- step algorithm to locate an approximate zero for f is no more than 

67(A/+13.1), 

where the average is taken over starting values on the circle of radius l + l/d. The constant Af is 
the average of the logarithms of the radius of convergence of f^^ at the roots Q. 

The cost of each step of the algorithm is dominated by the calculation of CCf{z)- Since this can 
be done with ff^dXo^ d) arithmetic operations (see [BM], for example), we have the following. 

Corollary 1.2. The average arithmetic complexity of locating an approximate zero for f by the 
a-step algorithm is G [AfdXo^ d), where the average is taken over starting values on the circle 
of radius 1 + 1/J. 



In path-lifting methods, it is useful to distinguish between the domain and range, so we have 

f ■ ^source ^ ^target- 

To implement the method, we choose a path 7 in the target space (typically a segment connecting an 
initial point wq = f{zo) to zero) and attempt to lift it back to the source space via a branch of /^^ In 
this form, such methods were introduced by Shub and Smale (see, for example [SS86] or [Sm85]), 
although one could argue (as Smale points out in [Sm81]) that in some sense this idea goes back 
to Gauss. See [Ren] and the references therein, as well as [KS]. The series [SS93a, SS93b, SS93c, 
SS96, SS94, Sh07, BS] discusses related methods for systems of polynomial equations. A survey 
of complexity results for solving polynomial equations in one variable can be found in [Pa]; see 
also [B08]. 

The difficulty of computing a local branch of along a path 7 in the target space is related to 
how close 7 comes to a critical value of /. However, not all critical values of / are relevant: only 
those which correspond to a critical point of / lying near the particular branch of f^^iy) have any 
impact. Consequently, it is useful to factor / through the (branched) Riemann surface ^ for f^\ 
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SO that we have 




where / is a diffeomorphism except at the critical points of /, and n is the projection map. With 
this viewpoint, the path 7 that we lift back to Csource lies in rY, and the troublesome points now are 
the branch points of y. 

One ingredient important to our analysis is understanding the Voronoi decomposition of y 
relative to the branch points. That is, for each branch point v of y, the Voronoi domain Vor(v) 
is the set of points in y which are closer to v than any other branch point of y (using the metric 
lifted via n). We show in §4 that the projection map n restricted to any single Vor(v) is at most 
(m + l)-to-one, where m is the multiplicity of the critical point of / corresponding to v (hence the 
projection is generically at most 2-to-one). Because the number of steps required for a path-lifting 
algorithm is related to the number of critical values the lifted path comes near, this result enables 
us to count the number of relevant critical values for a typical path. 

The path-lifting algorithm works as follows: We choose an initial point zo just outside the disk 
known to contain all of the roots, and let wo = fizo) ■ We then attempt to continue the branch 
of /^^ which has /^^(wq) = zq along the segment from wq to by choosing a suitable sequence 
Wn along this ray, together with approximations Zn such that ~ w,j. (Note that specifying a 
pair (z, w) such that w = f{z) is equivalent to specifying a point in y, so in practice our rays live 
naturally in y .) The process stops when a point Zn is detected to be an approximate zero. 

Given a pair (zn,>v„), the point Wn+\ is chosen to ensure that Zn is an approximate zero for 
f{z) — Wn+i ■ The tool we use to detect approximate zeros is the Kim-Smale a-function: if CCf{z) < 
0.1307, then z is an approximate zero. 

The paper is organized as follows. In section 2, we set out notation and preliminary notions. 
Section 3 describes the path-lifting algorithm explicitly. In section 4, we discus the branched 
surface y and the corresponding Voronoi partition; this section may be of interest independent to 
the question of root-finding. Section 5 computes several estimates related to how the polynomial 
/ behaves on the initial circle. In section 6, we calculate a lower bound on how far apart the points 
Wn and w„+i can be, and in §7 bound the number of steps needed for the algorithm to locate an 
approximate zero from a given starting point zq. This bound depends on the log of the angle /(zo) 
makes with the relevant critical values of /. 

Using these results, we can average the bound from §7 over all starting points; this is done in 
Section 8, which proves the main theorem. We conclude in Section 9 with some remarks. 

2. Preliminaries 
We will use the following general notions and notations throughout. 

An open disk of radius r > centered around z G C is denoted by Dr{z) . 

The function Arg denotes the argument of a complex number (in the interval {—n.n]). 
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The ray 4 C C of a point z G C \ {0} is 

4 = (0,oo) -2 = {w G C I Argw = Argz} , 

and the slit of this point is the part of the ray extending outward from z, that is 

o-z= [l,°°)-z= {wg4 I |w| > |z|} 

Finally, we will introduce some notation used when dealing with the Newton flow. Let / : C ^ C 
be a polynomial, and denote the critical points of / by 

^/ = {^l/W=0}. 
Consider the following vector field on C, 

m 



fizY 



The corresponding flow is called the Newton flow. This vector field blows up near the critical 
points of /. By rescaling the length of the vector by |^, the critical points of / become 
well-defined singular points of the rescaled vector field. This rescaled vector field is the gradient 
vector field z = — V|/(z)p; the solution curves of the former coincide with the latter, and we will 
use the two interchangably. The equilibria of the Newton flow are exactly the roots and critical 
points of /. Each root ^ is a sink; we shall denote its basin of attraction by Basin(^). Critical 
points are saddles for the flow. Furthermore, we can extend the flow to infinity, which is the only 
source. Each boundary component of Basin(^) contains critical points c G each critical point c 
has an unstable orbit leaving from c and converging to ^. This unstable orbit is a separatrix of c and 
will be denoted by 7^.. Generically, there is a unique critical point in each boundary component; in 
the degenerate cases, there could be saddle connections resulting in multiple critical points on one 
boundary component. A general discussion regarding Newton flows can be found in [STW] and 
[JJT]. See also Figure 4.2. 

We note that for each root ^, / is a biholomorphic map between Basin(Q and C \ \Jaf(^f.y 
where the union is taken over the critical points c which lie on the boundary of Basin(^). 

It is important to note that if is a solution curve for the Newton flow, f{^t) lies along a ray. 

Throughout the paper, we will consider polynomials / G <^/(l), that is, / : C ^ C given by 

nz) = Uiz-Cj) with 101 <i, 

with distinct roots l^j. The set of roots of / will be denoted by 

^F = {Q\j=h....d}. 

The restriction to ^(1) is not severe; it can always be accomplished via an affine change of 
coordinates depending only on the coefficients of /; see [M], for example. 



We shall use the following standard result several times. 
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Lemma 2.1. (Koebe Lemma) Let g : Z)^(O) —>Cbe univalent with g{0) = and g'{0) = 1. For 
z G Dr{0) with s = \z\/r, we have 



(2.1) 

and 
(2.2) 

Consequently, 
(2.3) 



< 18 < 



(1-5) = 



<\8(z)\<\z\ 



D,/,{0)CgiD,{0)). 



Remark 2.2. The last statement (2.3) is known as the Koebe ^-Lemma. The proof can be found in 
[Ko], [P]. 



3. The Path-Lifting Algorithm 

We now discuss explicitly the path-lifting algorithm that we will use to find an approximate zero 
ofan/G^rf(l). 

Definition 3.1. Let z„ G C be the iterate under Newton's method of the point zo G C, that is, 



Zn+l — Zn 



The point zq is called an approximate zero of / if 



for all n > 0. 



fjZn) 
fiZn] 



\Zn+l < I 2 



2"-l 



\zi —zo\ 



A sufficient condition for a point to be an approximate zero is developed in [K85] and [Sm86]. 
We will use the criterion formulated by Smale in [Sm86] to locate approximate zeros. It uses 
a : C \ ^ C defined by 



(3.1) 



a(z) — max 



fiz) 



fiz) 



f^^\z) 



M'{z) 



1 

7-1 



It is sometimes useful to use the related function 7(z) instead, where 



(3.2) 



y{z) = max 

7>1 



M'{z) 



1 



While we will primarily use a(z), we make use of y{z) in section 6. 

Theorem 3.2. [Sm86] There is a number Oq > 0.1307 such that if a{z) < OCq, the point z I 
approximate zero. 



IS an 
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Remark 3.3. The number Oq is given in [Sm86] and in many places throughout the literature 
as Oo ~ 0.130707. However, this specific value is very likely the result of a typographic er- 
ror in the fifth decimal place. Smale shows in [Sm86] that Oq is a solution to the equation 

(2r^ — 4r+ 1)^ — 2r = 0; the relevant root of this equation is 0.13071694 There have been 

subsequent improvements to this constant (see [WH] or [WZ], for example), but 0.1307 suffices 
for our purposes. 

Remark 3.4. Calculation of a{z) requires the ability to evaluate all derivatives of / at a z. In 
some situations, this is not possible; for example, if / is defined as an n-fold composition of some 
other function g, calculation of / and /' in terms of g and g' is simple, but calculating even f" is 
impractical. However, evaluation of higher derivatives may be avoided using the bound [Sm86]: 

11/11(9^(1^1))^ 



\fiz)\-Wi\z\) 



where (Pd{z) = T.z', f{z) = Y^aiz\ and ||/|| = max|<3,|. Alternatively, an adaptive version of the 
algorithm can be used which doesn't use (x: see remark 2 in §9. 

We shall analyze the following algorithm to find an approximate zero for / G ^d{^)- 

The a-Step Path Lifting Algorithm 

Step 0: Choose zqEC with |zo| = 1 + ^- Let 

wo = f{zo) and w 



\wo\ 

Step 1: Stop if a(z„) < 0. 1307; z„ is an approximate zero for /. 



Step 2: Let 



and 



1 \nzn)\ 

15 a{zn) 

_ _Wn+\-f{,Zn) 
Zn-'rX Zn 



f'iZn) ■ 

Continue with Step 1 . 

We shall sometimes refer to the points w,, generated by the algorithm above as guide points. 



Remark 3.5. If zo £ Basin(^) then the algorithm will terminate with an approximate zero for ^. 
There may be some values of n for which z„ ^ Basin(^). However, there exists a neighborhood 
i7 C C of the ray of wq which contains all f{zn) and on which there exists a univalent inverse 
branch of / mapping wq to zq. Denote this inverse branch by /,^^ :U — C. See Figure 3.1. 

For every zero ^ G 3ff consider the closest critical value to = f{Q 

= min ^ I /(c) I where ^f{Q = ^fn Basin(C). 
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ZQ 



Zn 



Zlr 




Basin(^i) 




Figure 3.1. While zq £ Basin((^i), we have Z3 G Basin((^2)- However, as noted in re- 
mark 3.5, there is a neighborhood U of the ray on which the same branch of the inverse 
contains all the z„. 



Remark 3.6. Note that is the radius of convergence off ^ at ^, and is the distance in the surface 
y between f{Q and the nearest branch point of y. 



For any polynomial /, we define 



Remark 3.7. Notice that Kf < 00 [f and only if 3ff D^f = Q). This holds generically for polynomials 
/, and Kf = 00 exactly when / has a multiple zero. Root-finding problems for which there is a 
multiple zero are typically called ill-conditioned or ill-posed. 

Remark 3.8. Kf is related to the condition number jAf, which is the reciprocal of the distance 
between / and an ill-conditioned problem (see [SS93b] and [Dem], for example). 

It is common to use to denote the condition number considering only those problems for 

which ^ is a root. Note that 



PC 



and 



1 

11 f > mm — . 



since the map f{z) 



has a multiple root at ^, where is the appropriate critical value with 



Let #/(z) be the number of steps the a-step algorithm needs to get to an approximate zero when 
it begins at z, which we refer to as the cost of the algorithm at z- The average number of steps is 
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denoted by 



1 

\V ■ d 

where we use starting points on the circle of radius l + \/d. 
Our main theorem (Thm. 1.1) says that if / G ^(1), we have 

./ / 2|loe^f| 

#/<67( 13.1 + ■ ' ^ ^' 




d 

The proof of the theorem will be prepared in the next sections, and is completed in section 8. 

Remark 3 . 9. We analyze the algorithm using starting points z G C with \z\ — l+C/d. Taking C—l 
yields the bound stated above. 

Remark 3.10. One can introduce a measure of difficulty Kf j^ corresponding to a given zero ^ G ^f. 
Proposition 7.1 gives an estimate for the time needed to reach an approximate zero for C, starting 
in zo in terms of ^/,^- 

4. The Voronoi Partition in the Branched Cover 

Given a polynomial / : C ^ C of degree d, recall that we denote its critical points by = 
{z I f'{z) = 0}. For any such /, we can express it as a composition f = TZo f. 



C 



where / is a diffeomorphism except on (where it is merely a bijection), and ;r is a J-fold 
branched cover, ramified at points of ^f. The metric on y, denoted by dist, is such that ;r is a 
local isometry away from points in ^ = /(^). 
The multiplicity of a critical point c G ^ is 

mc = min|yt|/'^+i)(c) ^O}. 

Notice that 

^ mc = d-l. 

The points in ^ are called critical values in y, and we define the multiplicity m^ of v = /(c) G Yf 
to be the multiplicity of c. 

We note that for each root ^ G ^f, 

;r:/(Basin(C))^C\ |J a, 

is an isometry (where 1^(0 = /(^/(C)))- 
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The Voronoi domain of a point v G ^ is 

Vor(v) = {m e ^ I dist(M,v) < dist(a, v'), Vv' G r/} . 
Recall that an open disk of radius r > around u E y is denoted by Dr{u). 

Lemma 4.1. u E Vor(v) if and only if n : — > D|„_^,|(;r(M)) is an isometry. In particular, 
ifuE Vor(v) then 

D\,,_,\{u)nyf = %. 

Proof. If M e Vor(v) then Z)|,,_v| (w) n ^ = 0. Thus, ;r is a local isometry on all of Z)|j,_v| ^i^d 
in particular, ;r is a global isometry on this disk. Conversely, If n is an isometry on all of /)|„_v| 
there can be no critical values in the disk, and so u eVor(v). □ 




Figure 4.1. The surface y for a degree 7 polynomial, viewed as a stack of seven slit 
planes. Each sheet is /(Basin(^;)) for the root i^,-, and is sUt along a,,^ (dashed lines), which 
begin at the branch points Vj G Yf (indicated by crosses). The central ellipses indicate 
71^^(0). In the figure, Oy. is labeled as Oj. The Voronoi domains of each of the vj are 
indicated, with boundaries marked by heavy black lines. Note that while Vor(vj) may enter 
many sheets, the projection is at most 2-to-l, as in Cor 4.5. See also Figure 4.2. 
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Let Ml, M2 G 5^ . If the line segment \%{u\),ii{u2)\ C C has a lift in 5^ which connects u\ with 
M2, we denote this lifted line segment by [wi , M2|- Observe that many pairs wi , M2 do not have such 
a connecting line segment. In this case we write [mi,M2| = 0. When [mi,M2| is nonempty, we say 
that Ml is visible from U2 in S^. Also observe, if v G then 

[m,v|7^0 forallMG Vor(v). 

We can form the visibility graph for 5^ as follows. The vertices of the graph are the critical 
values 'Yf, and there is an edge from v to w if and only if |v, w| is non-empty. We can identify the 
visibility graph with the subset of y given by 

^= U hA- 

Recall that / is a homeomorphism. Hence, /^^(^f) is well-defined, so we can also view $f as a 
graph immersed in C, with the critical points of / as vertices. 




-1.0 -0.5 0.0 0.5 1.0 



Figure 4.2. The Voronoi regions of Fig. 4.1 are shown in the source space. The roots of 
/ are indicated by circles, the critical points by crosses. The Newton flow is indicated by the 
small arrows, and the dashed lines are the boundaries of the basins of each root (each such 
boundary contains a unique critical point). /^^(Vor(vy)) is shown for each critical point 
Cj G (bounded by the heavy lines). In the figure, /^^(Vor(vy)) is labeled by Vor(cj). 
The visibility graph is also shown. 
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We shall say that a critical value w G ^ is a neighbor ofv if there is an edge between w and v in 
^. Denote the set of neighbors of v by 

For each edge of ^, we can define the lines 

^•,w = {ue y \ dist(M,w) = dist(M,v)} , 

which are the geodesies passing perpendicularly through the midpoint of each [v, w|. In the termi- 
nology of [VPRS] and [BV], each of the Lv,w is a mediatrix relative to the set Yf. 

Problem 4.2. Which abstract graphs can be realized as the visibility graph ^ of a polynomial? 

Lemma 4.3. For u e Vor(v) and w G ^ 

([M,v|\{M})nL,,,w = 0. 

Proof. According to Lemma 4.1, the metric on C =5^ is the usual Euclidean metric. This 

implies immediately that if 

then either w G £>|j,_v| (") or a G L^,y, see Figure 4.3. If w G £>|„_v| (")' it cannot be in 'Vf. □ 




Figure 4.3. By Lemma 4.3, if m G Vor(v), then [m,v| cannot cross Ly^„, since n is 
univalent on D|„_^,| (m). 



Lemma 4.3 can be used to describe the boundary of Voronoi domains. Specifically, for each 
V G Vor(v) is the connected component of 

which contains v. See Figures 4.1 and 4.2. 

Recall that the ray (.y C C of a point y G C \ {0} is the set of points which have the same 
argument as y. 
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If e y projects onto and [O, uj ^ 0, the geodesic starting at and containing [O, m] is the 

ray through u G y, which we denote by Observe that if fl ^ = then n : £u ^ ^k{u)j is a 
surjective isometry. 

Let y = 7t{u).lf iy n fi^^f) = 0, then 

n^^iy) = ly, U ly^ U ■ ■ ■ U fy^, 
where the points G ^ are the d different preimages of y. 
Proposition 4.4. Given v eYf and y G C \ /{"^f). Then 

card |/ 1 ly. n Vor(v) 7^ < m,, + 1. 

Furthermore, each £y. fl Vor(v) is a connected set. 

intersect Vor(v) , with v = /(c), c G Pick a point m,- in each of 

Ui G £^,.nVor(v). 



PrOO/ Suppose iy^ ,£y^,..., ly^^ 

these intersections, that is. 



K 



Di 



I j \ \ 




("1) \ \ , 


n{ui) 

K{Di)/ 





Figure 4.4. As proven in Proposition 4.4, the projection n is {my + 1) -to-one on Vor(v). 



Let Di = (a,). According to Lemma 4.1, we know that K . Di Tt{Di) is an isometry. 

Let Pi G be the perpendicular projection of v onto ly.^ and let p be the projection of /(c) = n{v) 
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onto £y. Then for all i < k, 

^ [v, c Di and ^ [;r(v) , ;?] c fl ;r(D,-) . 

i<k 

Hence, n : ^ Cis k-to-l in a neighborhood of v G with k < mc+l. 

The connectedness of fl Vor (v) follows from the triangle inequality. □ 

Corollary 4.5. Each projection n : Vor(v) C is at most {ruy + \ )-to-one. 

Let z G C. We'll say that a critical point c E'lof influences the orbit of z if the segment [O, /(z)] 
passes through Vor (/(c)). 

We are interested in the critical points which influence the starting values for our algorithm, and, 
conversely, the starting values which are influenced by a given critical point. 

Definition 4.6. For starting values z on the circle of radius r, we define the following sets: 

^ = {(r,c) G [0, 1] X I [O,/(re2^'0|n Vor(/(c)) ^ ©} 

= {c G ^/ I (f,c) G J] = e [0, 1] I (?,c) G ^} 

Notice that, for z = re^^" fixed, we have c E J^t precisely when, for some y G ^/■(^), ^|/(c)->| (3^) 
is the largest ball on which f^^ is defined. Similarly, for this pair c), we also have t G J^c. 

5. The Behavior of / on the Initial Circle 

Consider the function a,- : [0, 1) — R defined by 

ar{t)=Argf{re^^"), 

with r > 0. We can easily bound the rate of change of ar{t); while elementary, these bounds play 
a crucial role for us. 

Lemma 5.1. Let r > 1. Then for all t G [0, 1), we have 

r da,- r 
Ind < —^<2nd- 



r+l dt r-1 
Proof. Let z = re^'^'^ with r > 1. Since \Q < 1, we have |- EDi{0) = {w | |w| < j]. Acalculation 



shows 



dar ^ d , 9^,v, ^ „ z ^ „ 1 



dt dt J=i^-W ;til-wA 

For each root Q, we have 



r 1 r 
< Re — < 



r+l- l-^i/z-r-l 
Summing this inequality over the d roots and applying it to equation 5.1 gives the desired result. 

□ 
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Remark 5.2. The estimates in Lemma 5.1 are sharp. 
Corollary 5.3. Let r = l + l/d, and define 

{fire'') , 
te [0,1) I |Arg^4 — ^1 KAJorallce'^f 
f{c) 

Then 

^ 2A d-\ 
measure [B A) < 



7t d 

Remark 5.4. Let Ga be the complement of Ba. For each f G Ga, f~J will be analytic in a cone 

{w||Arg(w)-Arg(/K))|<A}, 

and consequently such t correspond to "good starting points" for a path-lifting algorithm. 

This is essentially Condition @ of [Sm85] and [SS86], with A = n/\2. It is shown in those 
papers (Prop. 2) that has measure at least 1/6 if one also takes r = 3/2 (which increases the 
number of steps by approximately Jlog(3/2)). The corollary 5.3 gives the measure of G;j/j2 to be 
at least 5/6. 

Lemma 5.5. Let c be a critical point on the boundary o/Basin(^), and let 7^ be the solution to the 
Newton flow emanating from c whose interior lies in Basin(^). Then ifr > I, 7^. fl 5^ = 0. 

Proof. Note that the Newton flow points inward on 5^ = {z | |z| = r} for r > 1, which follows from 
the observation that 

f{z) _ 1 

This immediately implies Lemma 5.5. 

To see this, note that since |z| > 1 and |^,| < 1, the vectors z— Ci all lie in the half-plane 
which contains D, . Consequently, their inverses and hence their sum E 1 /(^ ~ C;) lie in the com- 
plementary half-plane. Inverting again gives f{z)/f'{z) E J^, as required. □ 

We can now use the previous lemmas to estimate the width of the "necks" of Basin(Q. 



Lemma 5.6. Let r > I, ^ E 3ff, and let ybe a connected component ofS,- fl Basin(^). Then 

w N 1 . dar ^ ^ 

length (7) mm < 271 r 

2tz dt 



Proof. Let B C Basin( ^) be a boundary component of Basin( ^) with 70 5 7^ 0. Let c G fl 5 be 
the critical point which has an orbit 7^. C Basin(^) of the Newton flow starting at c and ending at 

Observe that 

/(7,U5) = (0,oo)./(c)=£^(,), 
the ray through /(c) . From the definition of 7 and Lemma 5.5 we get int(7) fl (5 U 7^.) = 0. Hence, 

Arg(/(int(7)))nArg(/(c))=0, 

that is, the image of 7 cannot make more than a full turn in the target space. The Lemma follows. 

□ 
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The following corollary follows immediately from the proof. 

Corollary 5.7. Let z\ and zi satisfy \z\ | = |z2| = with r > 1, and suppose also that they lie in the 
same connected component ofSr H Basin( ^). Then 

|Arg/(zi)-Arg/(z2)|<2;r. 



In the sequel we will consider integrals over the circle 5^ = {z G C | |z| = r}, which, for all r > 0, 
carries Lebesgue measure with unit mass. 

Lemma 5.8. Let r > and \ C,\< r then 

[\og\re^''" -ndt = \ogr. 
Jo 



Proof. Define 



S(C)= f\og\re^''" - Qdt 
Jo 

= / Re(log(z-C))-:^- 
JSr 2;r« z 

= Re-^ / log(z-C)--. 

27tl JSr Z 



Note that 



dS 



Re 



1 dz 



2ni JsrZ — C z 



= -Re 



1 



1/C 1/C 



Hence, 



2ni Jsr \z-^ z 
5(C) =5(0) =logr. 



dz^O. 



Corollary 5.9. Let f{z) = Uj=i {z - Cj)> with \^j\< r. Then 

[\og\fire^''")\dt = d\ogr. 
Jo 

Proof. 

/■I 









/'log 


n(re2--^-0) 


dt 


Jo 


7=1 





£ [\og\re^^''-^j\dt = d\ogr, 



where the last equality follows from Lemma 5.8. 

Remark 5.10. Notice that if r = 1 + 1/J, we have Jlogr < 1. 



□ 



□ 
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Problem 5.11. The previous corollary shows that the average value of log |/(z) | on Sr is d log r. Is 
there a constant Cr independent of d so that 

measure [t \ log \f{re^^'') \ < dlogr] > c,-? 



We now establish a lower bound on |wo| = |/(zo)| for starting values zo on the circle Sr with 
r > 1 . We shall use this in Lemma 6.9 to give a bound on the size of our final point w^. 

Proposition 5.12. Let z G Basin(^) with |z| = r > 1. There exists Sr < I such that 

\f{z)\>Sr-p^, 

where pi^ is the radius of convergence of the branch of f^^ taking to C,. 
Ifr> 1 + Sr = l- Otherwise, for r = 1 + Sr is the smallest positive solution of 

C = %K- 



(1-5)2- 

Remark 5.13. For < C < 2k, we have < 5^ < 3 - \/8. For C = 1, we have Sr ^ 0.0369 > ^. 

Proof. We will assume, without loss of generality, that the .x:-axis is aligned along ^. Let / be the 
radius of the largest disk centered at C, on which / is univalent, that is, 

A(C)cri(^Pc(0))cBasin(C). 

The Koebe ^-Lemma (Lemma 2.1), implies 

PC 



(5.2) / > 



\f'{Q\ 4 




Figure 5.1. Using the Koebe Lemma to calculate a lower bound on \ f{z) \ for z on Sr, in 
Proposition 5.12. 
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Let z be a point in Basin(^) with |z| = r. It is our goal to estimate |z — C|. First notice that if 
/ < |z — the desired resuk follows immediately from (5.2). Thus, we need only consider the 
case when / > |z — CI- 

This means that we can assume that z G Di{Q, and since |z| > 1, there is a point A e S\f]Di{Q; 
let ^ be the angle of the sector connecting 0,A, and 1. See Figure 5.1. 
The Koebe Lemma gives an upper bound on |z — ^ | ; for 5 < 

We now look for a lower bound on |z — C | by estimating for z G 5^ fl D/ ( C) . Notice that 



Z=y^C2-2Ccos(0) + l, 

since 

(cos^ - Q^ + s'm^^ = f 

where (cos((^), sin(^)) is the coordinate of the point A on 5/(C) flSi. 
From Corollary 5.7, we have 

Arg(/(A))-Arg(/(A))<2;r, 
and by Lemma 5. 1 (which bounds the radial derivative of /), we have 

TZ r+1 2tz 

d) = Arg(A) < < — , for all r > 1 . 

a r a 



Since r = 1 + we have 

\z 



d 

C r 1 I C 



/ a/?'-2Ccos(<^)) + 1 /^2_2Ccos(f ) + 1 



Notice that for < C < 2;r and \ C, \ < 1, the above expression is minimized when C = 1- Hence, 
we have 



for all d. This gives us 



^ /l-2cos(f ) + l 2^ 



In - In 4|/C)| 

This, together with the upper bound estimate (5.3), gives the lower bound on s as the solution to 



1% 4|/(C)|- (1-^)2 1/(01 
This simplifies to 

C = 8;r— 

{\-sY 

as desired. □ 
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6. The Size oe the Step 

Each iterate of the algorithm described in § 3 is guided by the values w„. The difference between 
Wn+i and Wn is called the rf^-jump and is denoted by 



where /„ = f{zn) and a„ = (x{zn)- To be able to control the algorithm we have to carefully adjust 
the range of the coefficient A. In this section we will explain the choice A = j^- 

If / were linear, the algorithm would follow w„ exactly, and /„ = w„. When the degree of / is 
at least 2, there will be a small error 

= \fn-Wn\- 

While the algorithm is described in terms of Csource (the Zn) and Ctarget ( fizn) and the w„), it is 
more straightforward to think of it in terms of the branched surface y. 

Let r„ > be maximal such that 

/^i:D,„(w.)-t/ 

is univalent, where U isa neighborhood of Zn- This is the distance between Wn G ^ and the critical 
value V EYf for which vv,, G Vor(v) . Also, let 7?„ > be maximal such that 

is univalent, where V is a neighborhood of z„. Note that /„ could be in Vor(v') for a critical value 
different from that used for iv„; in this case, we still use Rn = \v' — fn\. 




Figure 6.1. The various notations used througout this section, shown in the target space. 
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The following Proposition is a crucial ingredient for the estimate of the average cost. 
Proposition 6.1. 



Jn > 



1 

66 



We need to do a little bit of work before proving this proposition. Let 

f'n fn 

£n = Zn+l - Zn and h„ = -{Zn+l - Zn) " ^ = -£,1 " -J- ■ 

Jn Jn 

We use = f'{zn), fn = f'izn), and fn^ = f^-^\zn) as notation for the derivatives of / at z„. 
Lemma 6.2. If \ Unhn \ < 1 then 

Ctnhji 



^n+l = \fi+l -Wn+l \ < \hnf„\ 



1 1 CCjihfi I 



Proof. Note that since 

Wn+\ — fn 



Zn+l — Zn 



f 
J n 



we have w„+i = fn- (z„+i - = ( 1 - hn)fn- 



Thus, 



Sn+l = \fn+l -{l-hn)fn\ = \f{Zn + £n) " (1 | 

fn+fn£n + ^£^ + ----fn+hnfn 

Z^£2 I Z^^gS , 

2! " 3! 



\hnfn\ 



< \hnfn\ 



f" r 

Jn_ I Jn 
r;t-n-t 



(3) 



£ + 



2lfl, 3lf,[ 



fn fn 2 

CCn ~r £n C^n r £n 
Jn Jn 



< \Kfn\ ■ \ anhn + {anhnf + 
^ \ i r \ I ^nhn I 



The proof of the following can be found in [BCSS] (Lemma 8.2b and Prop 8.3b). Here = 7 



is as in equation (3.2); thus a„ 



Jn. 



Lemma 6.3. Let Un = OCnhn and V^(m) = 1 — 4m + 2u^. Then ifun< 1 — 1/ V2, we have 



f 

J n 



fi+l 



< 



(l-Un) 
W{Un) 



and 



Yn-^l 



< 



1 



Yn (1-M„)V/(m,i 
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Remark 6.4. In [BCSS], m„ is defined as {z„+\ —z„)Yn- We use 

, Wn+l—fn ( .fn 

Jn In 

and so our usage and that of [BCSS] agree. 

The proof of Proposition 6.1 will use induction. Given a choice for the positive numbers A and 
c we will use the following induction hypothesis 

(6.1) Ind„(A,c) : 5„<c-^. 

Yn 

The constants A,c > will be chosen later. The optimization process is better illustrated by 
using these general parameters instead our final value A = 1/15. 

Lemma 6.5. The induction hypothesis Indn{A,c) implies 

\Oinhn\ <A+C. 

Proof. Observe, 

\hnfn\ = \fn-W„+i\ 

< \Wn-Wn+i\ + \fn-Wn\ 
<Jn + 5n 

<A.^ + c-^ = (A + c)-^ 

□ 

So that we may apply Lemma 6.3, we impose the condition 

1 

A + c < 1 ^. 

V2 

By virtue of Lemma 6.5, this condition also ensures that the hypothesis of Lemma 6.2 is satisfied. 
Assume Ind„(A,c). We will prepare the induction step. From the proof of Lemma 6.5, we have 

\hnfn\<{A + c)^-^. 

Yn 



In Lemma 6.2, we obtained 

(Xfjhfi 



hnfn 



1 CCfihii 



If 



Yn 1 Ujj 



Consequently, a sufficient condition which implies Ind,i-|-i(A,c) is 



Yn 1-W« Yn+l 

or equivalently. 
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From Lemma 6.3, after simplification we get 

(A + c)^^--< 1. 

Since m„ <A + c and u/^f{u) increases monotonically for m G [0, 1 — 1/ \/2], we must have 

(A + c)2 1 
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(6.2) 



V/(A + c)2 c 



- < 1. 



We have established the following. 
Lemma 6.6. If(A,c) satisfies (6.2) then 

Indn{A,c) =^ Indn+i{A,c) 



The iterations are guided by the points w„ which decrease towards with jumps 



To optimize this convergence we need to find the largest A > for which there is a c > such 
that the pair (A,c) satisfies inequality (6.2). Numerics show that such solutions exist for A < 
0.0703039 < 1/14.22396; we can use A = 1/15 and c = 0.0158. Recall, 

/o 



5o = 0< 0.0158 



So Indo(-j^, 0.0158) holds. Then Lemma 6.6 implies that 



(6.3) 

holds for all n > 0. 



5„ < 0.0158 • 



The proof of Proposition 6.1 uses the following Lemma. This is essentially Corollary 4.3 of 
[K88]; the lower bound of ^ follows from the Extended Lowner's Theorem in [Sm81]. 



Lemma 6.7. 



l-Rn< — <i3-2V2)-Rn. 

4 an ^ 



With these lemmas in hand, we can now return to the proof of Prop. 6.1: 
Proof of Proposition 6.1. From Lemma 6.7, we get 

an - 15 4\fn\ 60 rn 

The radius of convergence at w,, is 
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where v„ is the critical value for which Wn G y lies in Vor(v„) . It might be that the radius at /„ is 
determined by another critical value, say 

Rn=\fn-V',\. 

Let r'„ = I w,j — v', | . Then we have 

rn < r'„ <\v'„- fn I + \fn " W„ \ 
= Rn + dn- 

In the case when v„ = v', we get the same estimate for r„. Notice, by using (6.3) and Lemma 6.7, 

<i?„ + 0.0158 -i^ 
0.0158 

<Rn + T—?^Rn 

3-2V2 



Consequently, we have 



< 1.09209 ■i?^. 



J > ^Jl > !jL 

1.09209-60 66' 



as desired. □ 
Lemma 6.8. If a„ > 0.1307, then 

|/„| < 1.1376 |w„| and | > 0.41982 

Proof. Observe, 
Hence, 

Now, 



l/J <w„ + 5„< |wj +0.0158-^. 

CC/i 

\fn\ < — 

0.0158 — 1-1376 \Wn\- 



k,+ il = l>v.|--- — 

, , / 1 1 



15 a„ -0.0158 

>k„|-(i-^- 



i I > 0.41982 |w„|. 

15 0.1307-0.0158; " ' ' 



□ 



Using these results, we can also obtain a relationship between the guide point wj^ where the 
algorithm terminates and p^, the norm of the closest critical value to 0. Recall that < 0.1307 
butttw 1 > 0.1307. 
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Lemma 6.9. For r > 1 + ^ 



Proof. From Proposition 5.12, we have 



Iwol >Sr-pt;>—. 

liwN = wq, the lemma holds trivially. 

If > 0, then Un-i > 0.1307 (and < 0.1307). 

From Lemma 6.7, we get 

|/v-i| > ^ ■ oJv-i --^v-i 

> 0.032675 -i^iv-i > 

P?-|/v-i| 



> 

- 31 



This last inequality follows from the triangle inequality: if v is the critical value with |v| = p^, then 
0, V, and fy-\ form a triangle with side lengths p^, Rn \, and |/v-i |- Rewriting the above yields 

(6.4) |/v-i|>^-Pf. 
We now apply Lemma 6.8 to obtain 

(6.5) Iwivl > 0.41982- |w;v-i I > 0.41982 ■ ^^^^ 

The lemma follows by combining equations (6.4) and (6.5). □ 



7. The Pointwise Cost 

In this section we will estimate the number #/^(zo) of iterates needed to find an approximate zero 
starting at zq- We need some preparation to be able to state the estimate. To simplify notation and 
without loss of generality, throughout this section we shall assume that lies along the positive 
real axis. Furthermore, we shall assume that no critical values of / lie along I^q. 

As before, let wq = f{zo) and the let the w„ be the guide points along E-^o as produced by the 
algorithm. Also let wq — f{zo) and vv„ be the corresponding points in the surface y, lying along 
the ray i^^. 

We divide i^Q into subintervals as follows: as noted in Proposition 4.4, for each v ^ 'ff the 
intersection of with Vor(v) will either be an interval or the empty set. Set = wq, and denote 
the first interval by [^o,^i] with corresponding critical value vi. In general, set 

[?;-i,?;] = Vor(vj)n?vv„. 

Let j8 = /3 (zo) denote the total number of such intervals. Note that for a point zq = re^^"^'' on our 
initial circle, we have 

/3(zo) =card^fo. 
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So that we may work in the target space C rather than in the surface =5^, we make the following 
observation. The projection n is an isometry in a neighborhood of iwQ, since Yf fl = 0. We 
define a set U {£wq) C ^ as 

where for yEC,y^ denotes the orthogonal projection of y onto i-^i^ (or its extension i-wo)- 

That is, for each critical point c, which influences the orbit of wq, we remove the ray perpendic- 
ular to £h,„ starting at the critical value /(c/). Lifting the result to y via the branch of taking 

£wo to iwo yields the set U {£wo)- 

Observe that n is an isometry on U{i^yf^), and furthermore, f/(^wo) contains i^^^ and a unique 
lift of each of the points /„ produced by the algorithm. Consequently, we have a well-defined 
correspondence between the target space C (minus finitely many rays) and a subset of ^ most 
relevant to the cu-step algorithm starting at zo- In what follows, we shall use the notation 

vor(v,) =;r(Vor(v,-)nt/(£wo)), 

and shall slightly abuse notation by using v, for f{ci). 

Note that the branch of /^^ which takes wq to zo is well-defined throughought all of 7r{U (iwo))\ 
in particular, it coincides with analytic continuation of /^^ along i^^. 




Let pj be the orthogonal projection of vj onto the ray ^h,) (or its extension, £-wq), and let xj 
\vj — Pj\. See Figure 7.1. Also, let Oj E {—7t, n] be the angle between vj and the ray that is, 

Oj = Arg{vj/wo). 
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Furthermore, use /3+(zo) to denote the number of 6j for which \6j\ < 7r/2 (or, equivalently, for 
which pj lies on i^^^). 

With this notation in hand, we can state an upper bound on the cost of finding an approximate 
zero starting from a point zq. 

Proposition 7.1. Let zo be an initial point for the a-step path lifting algorithm, with \zo\ > 1, let 
/ G ^(1), Wo = f{zo). Then the maximum number of steps required for the algorithm to produce 
an approximate zero starting from zq is 



where /3 + (zo) is the number of relevant critical values along with angle \ 0j\ < k/2, and w^ is 
the final " guide-point" for the algorithm. 

Remark 7.2. The above result may seem circular, since wm cannot be determined a priori. How- 
ever, Lemma 6.9 tells us that p^/S7 < \wf^\ < p^. 

In order to establish this proposition, we estimate the number of steps required to pass each 
Voronoi domain, and then sum over the /3 (zo) domains that passes through. 

If Wj and Wk are two guide points lying on £vio with k > j, we can define the rather trivial 
function Cost(wy,Wyt) = k — j. This measures the number of iterations required by the a-step 
algorithm beginning at a point zj near wj to obtain a point Zk near w^. We extend this function to 
all pairs of points yi and y2 lying on £vvo by linear interpolation. It is our goal in this section to 
estimate = Cost(wo5 >^v) where wn is an approximate zero. 

Rather than count the number of steps directly (which is possible, but tedious), instead we follow 
a suggestion of Mike Shub and integrate the reciprocal of the stepsize along 

Lemma 7.3. Letyi andy2 be two points ofi^^. Then 

py^ dy 

Cost{yuy2)<61 



wo- 



where ry = \y — v\ for each y E vor(v) fl i 

Proof. Recall that in section 6, we used 7„ to denote the n* jump, that is, 7„ = \ w„— Wn+\ \ where 
Wn is a guide point for the algorithm. Set J{w„) = Jn, and extend the function J{y) to all of ^h-o by 
linear interpolation. Now consider the differential equation along i^^ given by 

(7.1) ^ = -J(y) y(o) = ^o. 

dt 

Since J{y) is Lipschitz, the equation (7.1) has a unique solution. Observe that the points w„ are 
exactly the values given by using Euler's method with stepsize 1 to solve (7.1) numerically. 
Now consider instead the differential equation given by 

(7.2) 1^-^ .(0)^.0. 
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We wish to compare the solution of (7.2) to the Euler method for (7.1). We will show that for every 
y in any interval [w„+i,w„], we have ry/61 < J{y). Consequently, if (p{t) is the solution to (7.2) 
and = yi, ^{h) =y2, then we will have t2 — t\ > Cost(yi,y2)- 

To see that ry/61 < Jy for all y E [w,i+i,w„], we must examine a few cases. First, note that if 
y e vor(v/), we have 

ry = {y-pd^+xl 

Also, recall that by virtue of Prop. 6.1, we have J{w„) > rw„/66. 

First consider the case where the interval lies entirely in vor(v,). If w„+i > pi, then 

since ry is decreasing on the interval [p,,w„], we have J{y) > r^/66. If pj > will be 

nondecreasing. However, we can apply the triangle inequality (recalling that 7(w„) = w„ — w„+i) 
to see that 

ry < J{Wn) + rw„ < J{Wn) + 66J{Wn) , 

and so 7(w„) > ry/61 for ally in the interval. 

In the case where the interval intersects more than one Voronoi region, we proceed as follows. 
First, observe that for all y E [qj, w„] , we have already established that J{y) > Vyj 67 holds (where qi 
is the smallest point of nvor(v/)). Since \v\ — q\\ = — vz+i |, we havey(^/) > r^,/67, 

and we continue as above. 

Finally, the equation (7.2) is separable; elementary calculus yields 

f^o dv 

t{y) = 6lj ^. 

Jy fy 

□ 

Let J be a point on £„q, and let c be a critical point which influences wq; as before, let p be the 
orthogonal projection of /(c) onto and let x denote the distance between /(c) and p. 

For each y and a fixed critical point c, we also define the angle Ay, which is the angle that the 
segment from y to /(c) makes with the segment between /(c) and p. Notice that ry = |/(c) —y\. 
As before, use 6c to denote the angle between /(c) and iwQ- See Figure 7.2. 




Figure 7.2. The quantities y, ry, p, x. Ay, and dc- 



We now define the following function, related to Cost(3;i,3;2)- 

£ Jl, J2,C = log — — . 

\{y2-p) + ryj 
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By virtue of Lemma 7.3, if 3^1 and are both in vor (/(c)) , we have 
(7.3) Cost(3;i,3;2) < 67 / ^ = ^1 £{yx,y2,c). 

However, f will still be useful even when one or both of its first two arguments are not in vor (/(c) ) . 
We establish some bounds on the value of £ in the next few lemmas. 

Lemma 7.4. 



ry + {y-p) < 



X 



V3 if Ay < f 



Proof. Note that ry + {y — p) = x{tanAy + secA^). If Ay > n/6, we have ;c(tanAy + secA^) < 
3x tanAy = 3{y — p). When Ay < n/6, note that tanA^ + sec Ay is increasing in Ay.; at Ay = n/6, 

ry + {y-p)= xVs. 

We remark that this holds even if p < 0. □ 

Lemma 7.5. Letyi,y2 G i^^ with > 3^2 > 3/? > 0. Then 

yi 9 

£{yhy2,c) < log — +iog-. 

yi 4 

Proof. We consider two cases: when the angle Ay is large and when it is small. 
If Ayj < n/6, since > p 

£{yuy2,c) < £(yi,p,c) < log = logV3, 

where we have used Lemma 7.4 in the second inequality. 
If Ayj > n/6, we have (using Lemma 7.4 again) 

w 3(ji-p) 3yi{l-p/yi) 

£{yuy2,c)<\og- =log— — -— . 

2{y2-p) 2y2{l-p/y2) 

Since 3^2 > 3p, we have (1 — p/yi)/{i — p/3'2) < 3/2, and so 

yi 9 
£{yhy2,c) <log — +log-. 

y2 4 

Since -\/3 < 9/4, the above bound holds in either case. □ 
Lemma 7.6. Ifp>0, 

4^(1 n ^^1 4 + tan|0,| 
f(3p,0,c) <log- 



sec |0c| — 1 

We note that since p > 0, we have -n/2 < 9,- < n/2. Consequently, > 1. 

Proof. We have 

ri. n \ 1 (3p-p) + r3p 2p + (2p + ptan|0,|) 4 + tan|0,| 

£{3p, 0, c) = log < log — = log - 



rQ — p p?,Qc\Qc\—p sec|0c| — 1 



□ 
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Finally, we handle the case where \ Gc\ > 71 /2. 
Lemma 7.7. Ifyi >y2>0>p, £{yuy2,c) < \og{yi/y2). 

Proof. Observe that r^^ > yi — P, since ry^ is the hypotenuse of the right triangle with a leg of 
length y2 — p. Also, by the triangle inequality, r^j — ry^ < yi —y2- 
Using this, we have 

ryi + {yi-p) ^ {ry2+yi-y2) + {yi-p) 

ry2 + {y2-p) ~ 2{y2-p) 

_ 2y\-p + ry^-y2 



< 



2(j2-p) 
2{yi-p) + ry2~{y2-p) 
2{y2-p) 

< y^~p < Zi 

~ yi-p yi 

Consequently, f (3;i,3;2,c) = log ^''^j^^^^j < \og{yi/y2) as desired. 



□ 



We can now prove the main result of this section. 

Proof of Proposition 7.1. First, divide ^vi-o into segments where it intersects each of the j8(zo) 
Voronoi regions vor(vy) ; the j* segment will be bounded by points qj-i and qj (we set qo = wq, 
and qp^zo) ^ Figure 7.1. 

Now, we have 

j8(zo) j8(zo) 
(7.4) N = Cost{wo,WN) = Cost(^y_i,^^) < 67 £{qj^i,qj,Cj), 

where the inequality follows from Lemma 7.3 and (7.3). Applying Lemmas 7.5 and 7.6 gives us 



j8+(zo) iS+(zo) 

E £iQj-uqj.cj) < £ log^ 

7=1 7=1 



qj-l 



9 /^^(;o) 4 + tan|0;| 
+ /3+(zo)log-+ ^ log- 



7=1 



sec 1 0/1 — 1 



where ^* = max(|^y|, |3pj|). 

Note that since q* > \qj\, replacing q*- with qj will still give us an upper bound; furthermore, 
since \qj-i \ > \qj\, the logarithm of their ratio is positive. Thus, we have 



(7.5) £ilj-uqj.Cj) < log 

7=1 7=1 



1j 



+ /3+(zo)log-+ £ log 



4 + tan 1 0; I 



;ti sec|0^|-l 



Now we apply Lemma 7.7 to the remaining intervals (if any). 
(7.6) 



)6(zo) j8(zo) 

£iGj-hGj,Cj) < £ log 

;=/3+(;o)+l 7=iS+('o)+l 



^7 



Combining equations (7.5) and (7.6) with (7.4) and recalling that qo = wq, qp = wn gives the 
desired result. □ 
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8. The Average Cost 

In this section we shall prove our Main Theorem (Thm. 1.1), which follows from averaging the 
bound in Proposition 7.1 over the starting values on the circle of radius r = 1+C/d. 

Recall from Definition 4.6 that y is the set of pairs (?,c) for which the critical points c e 
influence the starting values zo = re" on the initial circle of radius r, J^; is the set of critical points 
which influence a given t, and J^c are the f G 5r which are influenced by c. 

For each pair in (?,c) G J^, we use Q = 0{t,c) to denote the angle between [0,f{re^^")] and 
[0,/(c)],thatis 

f(re^^it\ 
e(,,.)^Arg^L_). 

In the notation of section 7, 0{t,Cj) = Oj where Vj = f{cj) and {t^cj) E y. 

Note that for each fixed c, J^c is a collection of finitely many intervals: J^c consists of for those 
t such that £f{re") intersects Vor(/(c)) . 

Define for every c E'^f the function 0c : — ^ M by 

e,{t) = e{t,c)=Argl^^. 
Lemma 8.1. For each c E '^f, the map dc is at most {mc + 1) -to-one. 

Proof. For every 9 E {—7t, n] there are at most (mc + 1) rays £ C y for which the angle between 
[0,/(c)] and n{i) is 9 and which also intersect Vor(/(c)). This is a consequence of Proposi- 
tion 4.4. □ 

As an immediate consequence of Lemma 5.1, we have 
(8.1) 2nd-^< ^9c{t) <2nd- ^ 



r+l ~ dt ' ' ~ r - 1 



Proposition 8.2. Let f E ^(1) be of degree d and r > 1. Then 

4 + tan|0(.,c)| ^^^3 ^+1 
,t^, ^sec|0(r,c)|-l - r 

\e{t,c)\<7:/2 



Proof. Througout the proof, let ^^{9) = j • From Lemma 8.1 and (8.1), we see that for fixed 

values of c, we have 

r /■^/2 d9 r+l r^/^ 

/ logV/(0,(O)^^?<(m,+ l) / logv/(0)-— <(m, + l)— - / logV/(0)J0. 
J J -nil 9'it) 2nrdJ-Tt/2 



\ec(t)\<Ki2 
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Thus 

/ E logxif {e{t,c))dt= I log xir{e{t,c))dt 

2d -2 r+l 

< — 9.2901 

~" 27td r 



□ 



Recall that /3 + (z) denotes the number of critical points that influence the orbit of z = f^^^it ^]^]^ 
the critical value in the same half -plane, i.e., 

/3 + (re2'^'^) =card{c G I -Tt/2 < 9{t,c) < Tt/2}. 

The next proposition bounds the number of such Voronoi domains a starting value encounters, on 
average. 

Proposition 8.3. 

1+r 



Jo 



r 



Proof. Note that 

jjHre^ndt = f £ ldt=l^ J ^^^^ Idt. 

•^^ ■^^ ce-ft ce'^f-^ \ec(t)<7t/2 

\ecit)\<7t/2 ■' 

As in the proof of Proposition 8.2, we transport the calculation from the source space to the target 
space using the bound on 0^.{^) in (8.1) and the fact that for fixed c, 6c{t) is at most {lUc + l)-to-one 
(Lemma 8.1). This gives us 

['p^re'ndt< I r^'^< £K + l)^.;r<2(J-l)^<^. 
Jo ^^^J-K/2e',{t) ^.^^ 2Krd 2rd r 

Above, we used the fact that £ mc = J — 1. □ 



Lemma 8.4. //r > 1 + i 

/■^ Iwol 1 l+r , 

/ log ^ dt<d\ogr + log 87 + - ■ —— ■ logi^f . 
Jo \wn\ d r 
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Proof. Proposition 5.9, Proposition 5.12, Lemma 5.1, and Lemma 6.8 are used in the following 
calculation. 



/ log-j -dt= / \og\wQ\dt— / \og\wN\dt 

Jo \wm\ Jo Jo 

<dlogr- / log-hdt 
Jo o7 



1 1 +r 

< Jlogr + log87+ E llogPd 



d r 



1 1+r , 

< Jlogr + log87 + - logi^f 

d r 



□ 



1 \wq\ 2 

i?emarfc 8.5. If r = 1 + ^, then (ilogr < 1, giving / logr -J? < 1 +log87 + — logA'/- . 

Jo \wn\ d 



Now we are ready to provide a proof of the main theorem. 



Proof of Theorem 1.1. Let r = I + l/d. Proposition 7.1, Proposition 8.3, Lemma 8.4, and Propo- 
sition 8.2 imply 

Wf= ['#fire^^'')dt 



< / 67- 



logf^ + /3 + (re2-0log^+ £ log 



\WN 



\e{t.c)\<7t/2 



4 + tan|0(f,c)| 
sec|0(r,c)| - 1 



dt 



<67 
<67- 



;i+log87 + -|log/:/|) + 1.622 + 6 



13.1 + -|logi^^| 



□ 



9. Concluding Remarks 

(1) Our goal in this work was to bound the number of iterations of the a-step algorithm, rather 
than to optimize the arithmetic complexity. Since each step of the algorithm requires com- 
puting of all of the derivatives of /, one could use a higher-order method instead of New- 
ton's method (see [K88], [Ho], [SS86]) in the algorithm without a significant increase in 
cost, setting Zn+i = Tk{f{zn) — Wn+i), the truncation of the Taylor series. Use of such a 
method would result in a larger stepsize (and consequently fewer steps). 

(2) Alternatively, the use of a could be curtailed (or even entirely removed) by dynamically 
adjusting the guide points w„ as follows. At each step, set w„+i to be (1 — hn)\f{zn)\w. 
Initially, take h,i = Hq, but if f{zn) is not sufficiently close to Wn+i, divide h„ by 2 and try 
again until it is. At the next step, start with hn+i = min(/zo,2/z„). Note that this approach. 
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while similar in spirit, is somewhat different from the variable stepsize methods explored 
in [HS]. One can still use a to detect whether an approximate zero has been located, or, if 
evaluating higher derivatives of / is impractical, other methods such as those in [B02] or 
[O] can be used. 

(3) The ideas used in this paper can be adapted to those used in [KS] to locate approximate 
zeros for all d of the roots of /. That work uses a path-lifting method, but initial points are 
taken much further outside the unit circle, and a fixed stepsize is taken. In order to take the 
initial points on the circle of radius 1 + l/J, one needs to apply Lemma 5.1 to ensure that 
the initial points are properly spaced in the target space. 

(4) Using some of the ideas in [GLSY], one should be able to extend these results to deal with 
multiple roots or root clusters, for which Kf becomes arbitrarily large. 
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